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Lactococci isolated from non-dairy sources have been found to possess enlianced 
metabolic activity when compared to dairy strains. These capabilities may be harnessed 
through the use of these strains as starter or adjunct cultures to produce more diverse 
flavor profiles in cheese and other dairy products. To understand the interactions between 
these organisms and the phages that infect them, a number of phages were isolated 
against lactococcal strains of non-dairy origin. One such phage, OL47, was isolated from 
a sewage sample using the grass isolate L. lactis ssp. cremoris DPC6860 as a host. 
Visualization of phage virions by transmission electron microscopy established that this 
phage belongs to the family Siphoviridae and possesses a long tail fiber, previously unseen 
in dairy lactococcal phages. Determination of the lytic spectrum revealed a broader than 
expected host range, with 4>L47 capable of infecting 4 industrial dairy strains, including 
MLS, HP and 310, and 3 additional non-dairy isolates. Whole genome sequencing of 4>L47 
revealed a dsDNA genome of 128, 546 bp, making it the largest sequenced lactococcal 
phage to date. In total, 190 open reading frames (ORFs) were identified, and comparative 
analysis revealed that the predicted products of 117 of these ORFs shared greater than 
50% amino acid identity with those of L. lactis phage 0949, a phage isolated from cheese 
whey. Despite their different ecological niches, the genomic content and organization 
of OL47 and 0949 are quite similar, with both containing 4 gene clusters oriented in 
different transcriptional directions. Other features that distinguish OL47 from 4)949 and 
other lactococcal phages, in addition to the presence of the tail fiber and the genome 
length, include a low GC content (32.5%) and a high number of predicted tRNA genes (8). 
Comparative genome analysis supports the conclusion that OL47 is a new member of the 
949 lactococcal phage group which currently includes the dairy <I>949. 
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INTRODUCTION 

Cultures of lactic acid bacteria (LAB) used in cheese manufac- 
ture play a pivotal role in the formation of cheese flavor (Urbach, 
1995). While the limited numbers of established dairy cultures in 
use have greatly reduced inconsistencies in cheese quality, it can 
be at the expense of cheese flavor. LAB isolated from non-dairy 
environments, such as plant material, often exhibit enhanced 
metabolic capabilities when compared to those of dairy origin 
and have been shown to contribute to a more diverse flavor pro- 
file in the cheese (Ayad et al, 1999, 2000; Morales et al, 2003). 
In addition to their role in flavor enhancement, cultures of non- 
dairy origin have also been shown to be naturally insensitive to 
bacteriophages which infect industrial strains (Ayad et al., 2000). 
Phage infection is the single largest cause of industrial milk fer- 
mentation problems, negatively impacting on the consistency of 
cheese quality and resulting in large economic losses (Coffey and 
Ross, 2002). Non-dairy lactococcal strains could potentially be 



exploited for use in cheese culture rotations as phage insensitive 
strains and may well reduce the negative consequences of phage 
infection within a production plant. However, given the high evo- 
lution rate of phages, it is highly likely that over time, phages 
would emerge to threaten these bacterial strains. 

All lactococcal phages isolated to date are members of 
the Caudovirales order which comprises of three families: 
Siphoviridae (long, non-contractile tails), Myoviridae (long, con- 
tractile tails) and Podoviridae (short tails) [for review see Veesler 
and Cambillau ( 20 11 ) ] . Phage classification systems have been 
utilized in the design of anti-phage strategies to prevent phages 
amplifying to high titers in manufacturing plants (Jarvis et al., 
1991). DNA hybridization assays and comparative genomic anal- 
ysis coupled with visualization of virions has led to the division of 
lactococcal phages into one of ten groups (Deveau et al., 2006). 
Of these phage groups, the c2, 936 and P335 groups are pre- 
dominantly associated with failed dairy fermentations and are 
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frequently isolated from cheese manufacturing plants (Moineau 
et al, 1992; Murphy et al, 2013). Currently within the NCBI 
database, there are complete genome sequences for over 30 lac- 
tococcal bacteriophages. Phages classified within the same group 
can share a large degree of genome similarity as observed within 
the c2 and 936 groups (Lubbers et al., 1995; Chopin et al., 2001; 
Crutz-Le Coq et al., 2002). However, within the P335 group, 
there exists a high proportion of genetic diversity attributed to 
the presence of both temperate and virulent bacteriophages clas- 
sified within this group (Labrie and Moineau, 2007; Garneau 
et al., 2008). Collectively, these groups account for the majority 
of sequenced lactococcal phage genomes, ranging in size from 
22 to 40 kb. Larger lactococcal phages do also exist, with <t'949 
at 114.8 kb being the largest sequenced lactococcal phage to date 
(Samson and Moineau, 2010). With the elucidation of the $949 
genome, a genomic sequence is available for at least one member 
of each phage group. 

The use of a wide range of different bacterial strains pos- 
sessing an industrially important phenotype has presumably 
contributed to the diversity of lactococcal phages. Genome 
analysis of the rarer uncommon lactococcal phages has shed 
further light on this diversity. Comparative analysis of the struc- 
tural proteins of <I)1358, show a high degree of similarity to 
2 Listeria monocytogenes phages. Moreover this phage was also 
found to possess an uncommonly high %GC content of 51 
(Dupuis and Moineau, 2010). Recent analysis of the <I)1706 
genome, revealed that 22 ORFs shared similarities with pro- 
teins of Firmicutes, typically found in the human gut (Garneau 
et al, 2008). The ability of this phage to infect lactococci 
was attributed to the acquisition of a 4 gene module, allow- 
ing for host recognition of lactococcal cells (Garneau et al., 
2008). Genomic analysis of the <I)KSY1 genome has suggested 
the exchange of genetic material between bacteria and phages 
from different environments (Chopin et al., 2007). <I>Q54 was 
found to possess a different modular configuration, thought to 
be derived from recombination events with 936 and c2 type 
phages (Fortier et al, 2006). It is probable that these rarer 
phages arose from recombination events with other lactococ- 
cal phages and phages infecting other Gram-positive bacteria 
(Dupuis and Moineau, 2010). Moreover, these phages appear to 
be less suited to thrive in milk fermentations in contrast to phages 
of the more common lactococcal phage groups which possess 
rapid reproduction rates believed to be driven by evolutionary 
pressure (Ferguson and Coombs, 2000; Dupuis and Moineau, 
2010). 

In this study we describe the isolation and characterization of 
<I>L47, a large lytic phage which infects the non-dairy isolate L. 
lactis ssp. cremoris DPC6860. We also report the complete genome 
sequence of <t>L47 which, to our knowledge, is the largest lacto- 
coccal phage reported to date. Due to the emergence of non-dairy 
lactococci as dairy cultures with enhanced flavor-forming activity, 
sufficient data needs to be generated with regards to these cultures 
and their phage resistance if they are to be successfully utilized 
in dairy processing. Therefore, the objective of this study was to 
provide a better understanding of phages of lactococci from non- 
dairy origins at a phenotypic and genomic level, thus offering 
further insight into phage-host interactions. 



MATERIALS AND METHODS 

BACTERIAL STRAINS, BACTERIOPHAGE AND CULTURE CONDITIONS 

Dairy and non-dairy Lactococcus strains were supplied by the 
TFRC-Moorepark culture collection (Teagasc Food Research 
Centre, Moorepark, Ireland). L. lactis ssp. cremoris DPC6860 was 
previously isolated from grass (D. Cavanagh, unpublished) and 
characterized by 16S rRNA analysis as described by Alander et al. 
(1999). All lactococcal strains used in this study were cultured 
in M17 (Oxoid, Hampshire, England) media containing 0.5% 
lactose (wt./vol.) (VWR, Leuven, Belgium) (LM17) at 30°C for 
18-24 h under aerobic conditions. Double-strength M17 broth, 
used for phage enrichment of sewage samples, was prepared by 
doubling the amount of dry Ml 7 media and reconstituting it 
in the same amount of distilled water as the IX media with 
the addition of 1% lactose (wt./vol). Soft agar overlays and 
solid agar medium contained 0.75% and 1.5% agar, respectively. 
Lactococcal phages OKSYl, Oc2, ObIL170 and OP008 were 
originally obtained from the Felix d'Herelle Reference Center 
for Bacterial Viruses (GREB, Pavilion de Medecine Dentaire 
Universite Laval, QC, Canada) while phages <t>712, <I>HP, Oebl 
and <I>ML3 were obtained from the UCC culture collection 
(University College Cork, Cork, Ireland). Phages were propagated 
using their respective hosts at 30° C in Ml 7 media containing 
0.5% glucose (wt./vol.) (Sigma-Aldrich, Dublin, Ireland) and 
10 mM CaCl2. 

SPOT ASSAY 

Bacterial infection by phages was assessed using spot plate assays 
with phage titers of 10^ PFU/mL. Briefly, lOjxL of phage lysate 
was spotted onto an LM17 soft agar overlay containing 10 mM 
CaCl2 and inoculated with 1 x 10^ CFU/mL of the host organ- 
ism. Spot plates were allowed to dry before incubation aerobically, 
at30°Cfor 24 h. 

PHAGE ISOLATION AND PROPAGATION 

Raw sewage samples were collected from the sewage treatment 
facility in Mitchelstown, Co. Cork, Ireland. Phage isolation was 
conducted as described previously (Alemayehu et al., 2009). 
Individual plaques isolated following this method underwent 
3 successive rounds of plaque purification, to ensure that a 
pure phage was isolated. Briefly, a single plaque was aseptically 
removed from an overlay plate using a sterile 1 mL pipette tip and 
added to 5mL of mid-log phase host, containing 10 mM CaCl2. 
Following overnight incubation at 30° C, the mixture was cen- 
trifuged at 4,500 rpm for 15 min and filtered through a 0.45 [xm 
pore filter (Sarstedt, Wexford, Ireland). The filtrate was diluted 
to 10^^ and plaqued on the appropriate media for 24 h at 30°C. 
These steps were repeated twice on the resulting plaques until a 
pure phage was obtained. 

LYTIC SPECTRUM AND ADSORPTION TO LACTOCOCCAL CELLS 

The lytic spectrum of <I>L47 was determined, as for dairy lacto- 
coccal phages, by spotting 10 |i,L of phage lysate containing 10'' 
PFU/mL onto soft agar seeded with a L. lactis strain. Adsorption 
of phage particles to lactococcal cells was determined as fol- 
lows: lOjxL of phage lysate (~10'' PFU/mL) was mixed with 
2 mL of late exponential cells. Samples were incubated at 30°C 
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for lOmin with shaking, to allow the phage particles to attach. 
Each sample was centrifuged at 14,000 rpm for 5 min and fil- 
tered through a 0.2 (jum filter (Sarstedt, Wexford, Ireland). The 
number of phages in the supernatant was determined by plaque 
assay and % adsorption was calculated using the formula [(initial 
titer — titer in supernatant)/ initial titer x 100%]. 

ELECTRON MICROSCOPY 

High titer phage suspensions for visualization of phage parti- 
cles were prepared using CsCl gradient purification and ammo- 
nium acetate concentration. Pure phage samples were obtained 
using a CsCl gradient of polyethylene glycol ( Sigma- Aldrich, 
Dublin, Ireland) precipitates as described by Sambrook and 
Russell (2001). For ammonium acetate precipitation, 1 L of fresh 
lysate was centrifuged at 10,000 rpm at 4°C for 10 min and filter 
sterilized using a 0.45 |im pore filter (Sarstedt). Phage particles 
were subsequently precipitated by centrifugation at 20,000 rpm 
for 1 h at 4°C. The supernatant was removed and the phage pel- 
let re-suspended in 10 mL of ice cold 0.1 M ammonium acetate 
(Sigma- Aldrich). After pooling of phage samples, a further 10 mL 
of ammonium acetate was added and the sample centrifuged 
at 20,000 rpm for 1 h. The supernatant was again removed and 
the pellet suspended in a final volume of 1 mL ammonium 
acetate. Negative staining was performed on both phage sam- 
ples using 2% uranyl acetate on carbon films. Each grid was 
examined at an 80 kV acceleration voltage using a Tecnai 10 
transmission electron microscope (FEI Company, Eindhoven, 
The Netherlands). Micrograph images were captured using a 
MegaView 2 CCD-camera (Olympus SIS, Miinster, Germany). 
Phage structure dimensions were determined based on the aver- 
age of 10-15 measurements. 

STRUCTURAL ANALYSIS OF PHAGE PROTEINS 

Analysis of structural proteins was performed as described pre- 
viously (Kelly et al., 2012) using high titer phage suspensions 
obtained from ammonium acetate concentration. Samples were 
mixed with 4X sample loading buffer and heated at 95° C prior 
to loading in a 12% SDS polyacrylamide gel. Protein bands were 
stained using Coomassie blue staining and excess dye removed 
using a de-staining solution (40% ethanol, 53% distilled water 
and 7% acetic acid). Protein size was estimated using a broad 
range protein ladder (New England Biolabs, Hertfordshire, UK) 
as a relative molecular weight marker (MWM). 

DNA SEQUENCING, ANNOTATION AND COMPARATIVE ANALYSIS 

Bacteriophage DNA sequencing was performed using the Roche GS 
FLX-I- system to >20 x coverage (MWG, Ebersberg, Germany). The 
quality of the raw assembly reads were visualized and verified using 
the programme Hawkeye (Amos) (Schatz et al., 2013) and Consed 
(Gordon, 2003). To verify the genome structure, PCR ampli- 
cons were generated with the Platinum Hi- fidelity PCR Supermix 
(Invitrogen, Life Technologies, Dublin, Ireland) for various regions 
of the genome and at contig ends, followed by direct sequencing. 
The final phage genome was assembled using the Phred-Phrap- 
Consed package (Ewing and Green, 1998; Gordon, 2003). ORFs 
were predicted using the programs prodigal (Hyatt et al., 20 10) and 
Glimmer (Delcher et al, 1999). Annotation was providedby the 



RAST annotation software (Aziz et al, 2008) and GAMOLA 
(Altermann and Klaenhammer, 2003). Genome annotation 
was verified manually using Artemis (http://www.sanger.ac.uk/ 
resources/software/artemis/) (Rutherford etal, 2000) and detected 
open reading frames (ORFs) were functionally annotated using 
BLASTp (Altschul et al., 1990). Conserved domains were detected 
using InterProScan (http://www.ebi.ac.uk/InterProScan/) and 
DELTA-BLAST (Domain Enhanced Lookup Time Accelerated 
BLAST) (Marchler-Bauer et al, 20II). Identification of tRNAs 
was achieved using the software packages tRNA scan-SE (Lowe 
and Eddy, 1997) and ARAGORN vl.2.36 (Lowe and Eddy, 1997; 
Laslett and Canback, 2004). Comparative genome analysis of the 
Lactococcus <I)L47 with its most similar relative, Lactococcus <I>949, 
was performed using the Artemis Comparison Tool (ACT) pro- 
gramme (Carver et al., 2005). The genome sequence of <t>L47 
is available from GenBank/EMBL under the accession number 
KF926093. 

RESULTS AND DISCUSSION 
ISOLATION OF «L47 FROM SEWAGE 

Previous work in our laboratory led to the isolation of a bank of 
lactococcal strains from non-dairy origins, including grass, veg- 
etable matter and bovine rumen samples. These strains, which 
included both L. lactis ssp. lactis and L. lactis ssp. cremoris, were 
evaluated for their flavor-forming ability in a mini-Gouda cheese 
model and were shown to have potential to diversify flavor in this 
system (D. Cavanagh, unpublished). Future use of these strains in 
a commercial setting will depend on a number of technological 
characteristics inherent in the strains, including their sensitivity 
to phages. The phage sensitivity of the non-dairy isolate bank 
was assessed using 8 common dairy lactococcal phages. Phage 
titers in excess of 10'' PFU/mL were propagated and tested against 
the target strain using a spot plate assay performed in triplicate 
(Table 1). «>P008, 4)bIL170, $HP, t>C2 and «>KSY1 were inca- 
pable of infecting all of the non-dairy isolates tested. However, 
$ML3 was capable of infecting strains DPC6855 and DPC6856, 
isolated from grass and bovine rumen, respectively, while <I>ebI 
infected the strains DPC6854 and DPC6855 but not DPC6856 
(Table 1). 

Overall, the non-dairy isolates displayed significant levels 
of insensitivity to the dairy phages. Given the high evolution 
rate of phages to overcome bacterial defence systems (Sturino 
and Klaenhammer, 2006), however, it is highly likely that 
over time, phages would emerge to threaten these bacterial 
strains. To understand the interaction of these isolates with 
the phages that infect them, a screening programme was ini- 
tiated to identify phages specific for some of the non-dairy 
lactococci. Following 3 enrichment cycles of raw sewage sam- 
ples with the grass isolate L. lactis DPC6860 as a host, phage 
<I>L47 was isolated. <t>L47 was found to form clear plaques of 
~0.6 mm diameter when plaqued on L. lactis DPC6860. The for- 
mation of clear plaques would suggest that <I>L47 is virulent. 
Phage titers of 10^-10^ PFU/mL were recovered after a single 
propagation. 

LIMITED HOST RANGE OF L. lactis DPC6860 

In order to determine the host range of 't'L47, a range of lacto- 
coccal strains were chosen for analysis including industrial dairy 
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Table 1 | Phage resistance profile of dairy lactococcal phages to non-dairy Lactococcus strains. 



Strain Phage 





Phage group 
Origin 




936 






c2 




KSY1 
$KSY1 


*712 


$P008 $blL170 


*HP 


$C2 


«ebl 


$ML3 


L. lactis ssp. lactis DPC6853 


Corn 
















L. lactis ssp. cremoris DPC6854 


Grass 










+ 






L. lactis ssp. cremoris DPC6855 


Grass 










+ 


+ 




L. lactis ssp. cremoris DPC6856 


Rumen 


+ 










+ 




L. lactis ssp. cremoris DPC6857 


Grass 
















L. lactis ssp. cremoris DPC6858 


Grass 
















L. lactis ssp. cremoris DPC6859 


Grass 
















L. lactis ssp. cremoris DPC6860 


Grass 

















Starters, strains of non-dairy origins and a strain from raw milk 
(Table 2). It was found that <1>L47 is capable of infecting 3 out 
of 7 non-dairy Lactococcus isolates but only 4 out of the 19 
dairy strains, corresponding to hosts of 2 phage groups includ- 
ing the 949 group (Table 2). OL47 was unable to infect L. lactis 
IL1403, another host strain of 0949. Contrastingly, although 
<I>949 was only able to infect 7 of 59 strains of Lactococcus, these 
corresponded to hosts belonging to 5 phage groups (Samson 
and Moineau, 2010). Levels of adsorption of <t>L47 to L. lac- 
tis DPC6860 were >85% compared to <40% for other strains 
tested (Figure 1). However, <1>L47 was capable of lysing strain 
MLS but not 303 even though a higher level of phage adsorp- 
tion was observed in the latter. This would suggest that L. lactis 
303 possesses a form of anti-phage defence which prevents phage 
DNA entry or cleaves phage DNA. 

EXTENDED TAIL FIBER OF $L47 

Visualization of phage particles was achieved by transmission 
electron microscopy performed on samples obtained from both 
CsCl gradient purification of PEG precipitations and ammonium 
acetate concentration. Images generated from samples obtained 
by CsCl gradient showed the majority of phage virions to have 
disintegrated, with broken tails and empty capsids. Images gener- 
ated from ammonium acetate preparations displayed intact phage 
particles which allowed for the estimation of capsid diameter 
and tail length (Figure 2). It was determined that <I>L47 possesses 
an icosahedral capsid (diameter 75 nm) and a non-contractile 
tail (length 480 nm; width 15 nm) indicating that this phage is 
a member of the family Siphoviridae of the class Caudovirales. 
Morphologically, <I>L47 is very similar to the other lactococcal 
phage, <t'949 which possesses a non-contractile tail of 500 nm in 
length with an icosahedral capsid of 79 nm in diameter (Jarvis, 
1984; Deveau et al, 2006; Samson and Moineau, 2010). However, 
OL47 appears to possess a distinctive tail fiber of 280 nm in 
length and tail fiber width of <10nm (Figure 2B). Siphoviridae 
phages frequently possess a tail fiber involved in phage infec- 
tion, however, a tail fiber of this length has not previously been 
reported in lactococcal phages (Vegge et al., 2005; Boulanger et al., 
2008; lakutyte et al., 2012). Enterococcus faecalis bacteriophage 
OEF24C-P2 was found to possess a long tail fiber which was asso- 
ciated with higher infectivity against bacterial strains than the 



Table 2 | Host range of Lactococcus phage $L47. 



Species 




Orlgln/Use 


Host celllysis 


L. lactis ssp. 


cremoris DPC6855 


Grass 




L. lactis ssp. 


cremoris DPC6859 


Grass 


-l- 


L. lactis ssp. 


cremoris DPC6854 


Grass 




L. lactis ssp. 


cremoris DPC6858 


Grass 




L. lactis ssp. 


cremoris DPC6857 


Grass 


+ 


L. lactis ssp. 


cremoris DPC6856 


Rumen 




L. lactis ssp. 


lactis DPC6853 


Corn 




L. lactis ssp. 


cremoris H P 


Dairy 


+ 


L. lactis ssp. 


cremoris IE16 


Dairy 




L. lactis ssp. 


cremoris MG1363 


Dairy 




L. lactis ssp. 


/actelL1403 


Dairy 




L lactis SMQ562 


Dairy 




L. lactis ssp. 


cremoris UC509 


Dairy 




L. lactis ssp. 


lactis biovar diacetylactis F7/2 


Dairy 




L. lactis ssp. lactis MLB 


Cheese 


+ 


L. lactis ssp. cremoris H88M1 


Raw milk 




L. lactis ssp. lactis 303 


Cheese 




L lactis HT- 


2 


Dairy 




L. lactis ssp. 


cremoris SK11 


Cheese 




L. lactis U53 




Dairy 


+ 


L. lactis ssp. lactis biovar diacetylactis DRC3 


Dairy 




L. lactis 83 




Dairy 




L. lactis BA1 




Dairy 




L. lactis ssp. lactis biovar diacetylactis 938 


Dairy 




L. lactis ssp. 


cremoris 310 


Cheese 


+ 


L. lactis ssp. lactis 229 


Cheese 




Lactobacillus plantarum 


Grass 




Lactobacillus brevis 


Grass 





un-mutated phage which does not have a tail fiber (Uchiyama 
et al., 2011). Similar to <I)EF24C-P2, an extended tail fiber was 
found in Bacteroides fragilis phage ATCC 51477-BI, which was 
isolated from wastewater (Hawkins et al., 2008). No bouquet-like 
arrangement was identified in <I>L47, as demonstrated by phage 
<t>Y of Bacillus anthracis which possessed a 63 nm long tail fiber 
(Schuch and Fischetti, 2006). 
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FIGURE 1 I Relative % adsorption of 4>L47 to Lactococcus strains of 
different environmental niches and subspecies designation. 
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FIGURE 2 I (A) Transmission electron micrograpli image of 4>L47 
negatively stained using 2% uranyl acetate. (B) Single view of <I>L47. Arrow 
shows the elongated tail fiber. 



STRUCTURAL PROTEIN ANALYSIS 

SDS-PAGE was employed to isolate the structural proteins of 
OL47 and this led to the identification of 9 individual pro- 
tein bands (Figure 3). Five proteins of low molecular weight 
(<30kDa) identified from genome sequence analysis, were not 
identified on the gel. These proteins may not have been present 
at a sufficient concentration to allow for visualization. Molecular 
masses of proteins estimated by SDS-PAGE corresponded to 
the estimated molecular weight of structural proteins predicted 
from genome sequencing. Three major structural proteins were 
identified from SDS-PAGE, with the largest having a molecular 
weight of 31.4kDa and may correspond to a base plate protein 
(ORF 176). One large minor protein of ~100kDa was visual- 
ized on the gel but did not correspond to any identified ORE 
Multiple bands were observed at ~50 kDa and ~30 kDa which 
could not be assigned to identified ORFs. It is possible that these 
proteins are proteolytic products of some other phage struc- 
tural proteins, or they are of bacterial origin. Techniques such as 
N-terminal sequencing of protein bands would allow for a more 
definite relationship to be established between genome analyses 
and SDS-PAGE. 



ANALYSIS OF THE L47 GENOME 
GENERAL CHARACTERISTICS 

To date, all 59 sequenced L. lactis phages available in the pub- 
lic domain possess a dsDNA genome and are predominately 
15 to 35 kb in size with the exception of the larger genomes 
of 4)KSY1 (79.2kb), $P087 (60.1 kb), and $949 (114.8 kb). 
Analysis of the <I>L47 genome revealed a dsDNA molecule of 
128, 932 bp, making it the largest sequenced lactococcal phage 
to date. <I>L47 was found to possess a molecular GC content 
of 32.5% which corresponds to that observed in the similarly 
sized Lactococcus phage, $949 (32.7%). The GC content of the 
host organism, i. lactis DPC6860, was calculated as 35.56%. 
A total of 190 ORFs were identified in the <I>L47 genome, 
117 of which shared significant homology with I. lactis <I)949 
(>50% amino acid identity). The majority of these genes were 
involved in capsid and tail morphogenesis as well as various 
hypothetical proteins. <t>949 was isolated from cheese whey 
in New Zealand over 35 years ago while <1>L47 was isolated 
from raw sewage indicating how this phage group thrives in 
their respective environments. Five ORFs were identified that 
shared no significant identity with proteins in the databases. 
Similar to the unusual genomic arrangement of <I>949, 4 gene 
clusters were identified in the <1>L47 genome, with 2 clusters 
transcribed in the opposite direction. Large inter-genic regions 
were identified between ORFs 37-38 and ORFs 54-55 which 
were preceded by a shift in the direction of gene transcrip- 
tion. The 190 ORFs identified in the OL47 genome (Table 3) 
account for 88% of the genome which is the same as that iden- 
tified in 0949. No putative function could be assigned to 132 
ORFs (69.47%). The predominant start codon (89.71%) was 
ATG with only 8 ORFs starting with an uncommon start codon 
(TTG, GTG). 

TAIL MORPHOLOGY 

As stated previously, <t>L47 possessed a long non-contractUe tail 
of 480 nm in addition to a long tail fiber of 280 nm. Both cap- 
sid and tail structural genes were encoded from ORFs 157 to 
182, and organized in a modular arrangement located down- 
stream from the holin gene (Figure 4). The products of 14 of 
these ORFs encoded putative structural proteins sharing >70% 
amino acid identity with <I)949 (Table 3). This is not unusual 
as the major structural proteins are generally somewhat con- 
served in related phages (Suyama and Bork, 2001; Ceyssens et al., 
2011). In contrast, proteins involved in adsorption to host cells, 
such as tail fibers, are expected to differ to a larger degree as 
they are modified to complement with surface receptors of bac- 
terial hosts (Sandmeier et al, 1992; HatfuU, 2002; Desplats and 
Krisch, 2003; Silhavy et al, 2010). ORFs 160-163, located within 
the structural module, encode proteins sharing a high percent- 
age identity (>60%) to proteins of ObIL286. ORF 159 encodes 
a protein with a high percentage identity to a putative tail pro- 
tein of Lactococcus 0949 but also the putative base plate protein 
of Lactococcus <I)P335, and the phage tail assembly protein of 
L. lactis ssp. cremoris A76. A conserved domain search of the 
putative tail protein of <t>949 reveals the presence of a prophage 
tail super family cll2123 domain which may possibly act as an 
endopeptidase. 
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14.3 — • i Structural protein 18.7 168 




Structural protein 17,5 158 

Structural protein 16.9 177 



FIGURE 3 I Analysis of the structural proteins of *L47 obtained (MWM). Molecular masses of structural proteins was predicted from 

from ammonium acetate precipitations. A broad range ladder (New genome sequence data using the Compute pi/ Mw tool (Gasteiger 
England Biolabs) was used as a relative molecular weight marker et al., 2005). 



ORF 160 encodes a putative host-specificity protein sharing a 
60% identity at the amino acid level with that of the prophage 
<I>bIL286. This ORF also shares identity to the phage receptor 
binding protein (35%) and tail host specificity protein (36%) of 
the plant strain L. lactis KF147. ORFs 162 and 163 also indicate 
homology with ORF 55 of $bIL286, both with 89 and 94% iden- 
tity, respectively. No conserved domains were identified within 
these two proteins; however, using BLASTp, both proteins were 
found to possess some identity to a putative tail fiber of Bacillus 
<I>phiNItl. In order to assign a putative function to these ORFs 
of <1)L47, ORF 55 of ObIL286 was analyzed by InterProScan. The 
C-terminal region of this gene product was predicted to have an 
immunoglobulin-like domain IPR007110 as found in the major 
tail subunit of Enterobacteria <I>HK97 (Juhala et al, 2000) and 
other phages. Immunoglobulin-like domains participate in var- 
ious functions, including cell-cell recognition and cell-surface 
receptors. Therefore, it is likely that these genes also form part 
of the $L47 tail. 

HOST CELL LYSIS 

In dsDNA phages, the combined action of the holin-lysin genes 
function in the release of new phage particles from an infected 
ceU (Daniel et al, 2007). ORF 69 was found to encode a 
putative endolysin sharing 89% identity with that of <1>949, 
which possesses an amidase_2 domain pfam01510. This ORF 
also shows a high degree of similarity with gp073, sequence 
ID YP_001469072.1, of the temperate lactococcal phage 4)KSY1 
(83% identity). No holin gene was identified in the same locus 
as the endolysin, however, a putative holin was located remotely 
from the endolysin (Figure 4). ORF 154 exhibits an 87% identity 
to the putative phage holin from 0949, containing a holin LLH 



superfamily cl09890 domain, a conserved domain of ~ 100 amino 
acids found in prophage and phage of Gram-positive bacteria. 
The arrangement of the holin and endolysin genes in the <I>L47 
genome is not typical of lactococcal phages but a similar organi- 
zation has been reported in <I>949, 4>P087, and *I>1708 (Garneau 
et al, 2008; Villion et al, 2009). 

DNA REPLICATION AND NUCLEOTIDE BIOSYNTHESIS 

The DNA replication module was located downstream from the 
endolysin gene and upstream from the holin gene. The pre- 
dicted gene products for ORFs 75 (Helicase), 76 (Primase), 
66 (Nucleotidase), 110 and 111 (DNA gyrase) and 114 
(Deoxyribosyltransferase) displayed a large degree of homology 
with 0949. ORF 89 was identified as a putative replication protein 
which shares 84% identity with the putative replication initiator 
protein of L. lactis ssp. lactis lO-l (Kato et al, 2012). A DHH 
domain pfam01368 was identified in ORF 77 which includes 
the single-stranded DNA exonuclease RecJ. The protein encoded 
by this ORF exhibits a 95% identity with the single-stranded 
DNA exonuclease of Lactococcus <I)C13, Lactococcus OCaesusJMl 
and Lactococcus 0949. An exonuclease functions to catalyse the 
cleavage of a single nucleotide from the end of a polynucleotide 
chain and is involved in DNA repair, recombination and replica- 
tion (Ceska and Sayers, 1998). No putative endonucleases were 
identified in the L47 genome, in contrast to two distinct HNH 
endonucleases in <I>949. Structure specific endonucleases cut at 
particular DNA structures and in some cases can give a com- 
petitive advantage to a given bacteriophage in a mixed infection 
(Goodrich-Blair and Shub, 1996). It is possible that 4)L47 pos- 
sesses no HNH endonucleases as it has acquired other charac- 
teristics, such as possessing a reduced number of endonuclease 
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ORF 


Start 


Stop 


F/R 


IVIWlkDa) 


Best match 


%ID 


E-value 


Accession number"^ 


1 


146 


1165 


F 


39.0 


Putative integrase-recombinase [Lactococcus 


324/339 (96) 


0.0 


YP_004306161.1 












*9491 








2 


1174 


3495 


F 


87.6 


Putative exodeoxyribonuclease [Lactococcus 


707/772 (92) 


0.0 


YP_004306162.1 












tl>9491 








4 


4260 


4949 


F 


24.0 


Putative transglycosylase [Lactococcus 4>9491 


195/231 (84) 


4e-132 


YP_004306164.1 


5 


4963 


5517 


F 


21.4 


Putative undnaracterized protein [Firmicutes 


29/89 (32) 




CDD64083.1^^M 












bacterium CAG:882| flHHH^^^B 








12 


7633 


8229 


F 


22.7 


Putative thymidine kinase {Lactococcus <t>9491 


184/196 (94) 


3e-134 


YP_004306171.1 


14 


8674 


8817 


F 


5.55 


Hypothetical protein, unlikely [Trypanosoma 


16/33 (48) 


5.8 


CCD16863.1 ■ 












congolense 1 L3000] ^^g^^^ 






■ 


15 


8841 


8951 


F 


5.46 


Transglycosylase-like domain protein 


18/44 (41) 


0.32 


WP_008087686.1 












[Streptococcus ictaluri] 








23 


10569 


10796 


F 


8.43 


Phage protein [L. lactis ssp. lactis KF147] 


35/64 (55) 


5e-18 


YP_003353508.1 


24 


10814 


11191 


F 


14.0 


Hypothetical protein PelgB_21892 [Paenibacillus 


61/122 (50) 


1e-33 


ZP_09077 133.1 












elgii B69] 








26 


11447 


11770 


F 


12.4 


Hypothetical protein LACR_2119 [L. lactis ssp. 


81/120 (68) 


4e-46 


YP_8117071 












cremoris SK1 11 








27 


11770 


12045 


F 


10.1 


Hypothetical protein ul36_25 [Lactococcus <t>ul36] 


50/84 (60) 


3e-20 


NP_663659.1 


28 


12038 


12367 


F 


12.5 


ORF23 [Lactococcus *TP901-11 


39/89 (44) 


9e-8 


NP_1 12686 


29 


12364 


12609 


F 


88.9 


Hypothetical protein Q33_0026 [Lactococcus 


77/81 (95) 


3e-50 


AFV51055 












tl>Q33] 








30 


12644 


12835 


F 


7.07 


Hypothetical protein [Lactococcus 't>p2] 


38/d4 (59) 


1 e-14 


AL 19489 /.I 


31 


12912 


13415 


F 


19.8 


Hypothetical protein [Limnohabitans sp. Rim47] 


20/93 (22) 


0.32 


WP_01 9430841.1 


32 


13415 


13630 


F 


8.27 


Hypothetical protein [Methyloversatilis universalis] 


15/48 (31) 


2.0 


WP_01 8228448.1^ 


35 


14561 


14914 


F 


13.9 


Orf28 [Lactococcus tl)blL286) 


35/108 (32) 


0.37 


NP_076662.1 


37 


15241 


15360 


F 


4.67 


Hypothetical protein [L. lactis ssp. lactis CV561 


24/35 (69) 


1e-06 


YP_005869010.1 


39 


17948 


17772 


R 


6.99 


Unknown 








40 


18472 


18083 


R 


15.0 


Hypothetical protein [Enterococcus OSAP6] 


46/118 (39) 


6e-11 


AEM24750.1 


44 


19772 


19539 


R 


9.23 


Hypothetical protein ACD_5C00356G0007 


18/61 (30) 


1.3 


EKE24920.1 












[uncultured bacterium] 






■ 


45 


20124 


19852 


R 


10.9 


Hypothetical protein ECED1_3531 [Esctierichia coli 


24/69 (35) 


0.002 


YP_002399391.1 












EDIa] 








46 


20281 


20141 


R 


5.35 


Unknown 








47 


20856 


20716 


R 


4.52 


Unknown 








48 


21121 


20867 


R 


9.73 


Hypothetical protein EFP_gp119 [Enterococcus 


45/79 (57) 


1e-19 


YP_001 504228.1 












<I>phiEF24C] 








49 


21414 


21178 


R 


9.60 


Hypothetical protein [Enterococcus faecium] 


38/74 (51) 


2e-14 


WP_002350596.1 


52 


22318 


22007 


R 


11.7 


Hypothetical protein EFP_gp116 [Enterococcus 


29/ 72 (40) 


4e-12 


YP_001 504225.1 












<I>phiEF24C] 








53 


22680 


22411 


R 


10.4 


Hypothetical protein LmalK35_10926 [Lactobacillus 


25/ 75 (33) 


5e-04 


ZP_09449152.1 












mali KCTC 3596] 








54 


23092 


22751 


R 


13.3 


LamB/YcsF family protein [IVIesoflavibacter 


28/ 122 (23) 


1.6 


ZP 09499326.1 ■ 












zeaxanthinifaciens S86] 








55 


25273 


25410 


F 


5.59 


Hypothetical protein ARALYDRAFT_900440 


16/40 (40) 


0.63 


XP_002886300.1 












[Arabidopsis lyrata] 








57 


25619 


25768 


F 


5.61 


Zinc finger protein 263 [Monodeiphis domestica] 


17/42 (40) 


0.34 


XP_001 370637 2 


58 


25765 


26346 


r 


22.6 


Hypothetical protein [Streptococcus henryi] 


yu/ lo/ l4oJ 


2e-44 


\A/D moicnco/i 1 
VVr_u 1 o 1 bbbz4. 1 


60 


26593 


26979 


F 


15.8 


Hypothetical protein CLJ_0158 [C. botulinum Ba4 


39/130 (30) 


5e-07 


YP_002860465.1 












str. 657] 








61 


27057 


27260 


F 


7.92 


Hypothetical protein CAEBREN_29888 


19/53 (36) 


0.45 


EGT53499.1 












[Caenorhabditis brenneri] 








62 


27257 


27691 


F 


16.8 


Hypothetical protein EfaCPT1_gp44 [Enterococcus 


41/139 (29) 


6e-04 


AFO10841.1 












tUEfaCPTI] 
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ORF 


Start 


Stop 


F/R 


MW(kDa) 


Best match 


%ID 


£-value 


Accession number*^ 


64 


28027 


28230 


F 


7.90 


Hypothetical protein LPV712_gp041 [Lactococcus 


34/66 (52) 


4e-13 


YP_764301.1 












*7121 








66 


28820 


29419 


F 


23.2 


Putative nucleotidase {Lactococcus *9491 


190/199 (95) 


5e-136 


YP_004306212.1 


69 


30662 


31708 


F 


37.8 


Putative endolysin [Lactococcus <1)949] 


311/348 (89) 


0.0 


YP_001 469072.1 


72 


33459 


33629 


F 


6.35 


Hypothetical protein xccb100_2160 IXanthomonas 


15/53 (28) 


1.5 


YP_001 903564.1 












campestris pv. campestris str. B1001 








75 


35411 


36955 


F 


59.2 


Putative replicative DNA helicase [Lactococcus 


481/514 (94) 


0.0 


YP_004306220.1 












<S>949] 








76 


36966 


38015 


F 


40.5 


Putative DNA primase [Lactococcus <1)949] 


302/329 (92) 


0.0 


YP_004306221.1 


77 


38022 


39806 


F 


68.0 


Putative single-stranded DNA specific exonuclease 


566/593 (95) 


0.0 


ACU46843.1 












[Lactococcus <I>CB131 








80 


40336 


40815 


F 


18.2 


Membrane protein containing HD superfamily 


27/71 (38) 


1.1 


CCC58155.1^^H 












hydrolase domain, YQFF ortholog [Caloramator 


















australicus RC3] 








82 


41311 


41496 


F 


7.74 


Unknown 








83 


41493 


41636 


F 


4.99 


Unknown 








84 


41639 


41854 


F 


8.84 


GJ 15471 [Drosophila virilis] 


20/54 (37) 


0.34 


XP_0020600571 


85 


41848 


42081 


F 


9.16 


Conserved hypothetical protein [Anaerococcus 


16/54 (30) 


1.3 


ZP_03930743.1 V 












tetradius ATCC 35098] 






m 


86 


42081 


42458 


F 


14.4 


GA28089 [Drosophila pseudoobscura 


21/54 (39) 


0.44 


XP_002133330.1 












pseudoobscura] 








88 


42897 


43262 


F 


14.4 


Putative phage protein [L. lactis ssp. lactis AH] 


60/99 (61) 


1e-24 


CDG04128.1 


89 


43259 


43648 


F 


14.6 


Putative replication initiator protein [L. lactis ssp. 


108/129 (84) 


3-75 


YP_007508915.1 












Lactis 10-1] 








90 


43695 


44000 


F 


12.2 


Hypothetical protein [L. lactis ssp. lactis CV56] 


36/105 (34) 


6e-07 


YP_005868381.1 


92 


44413 


44592 


F 


7.14 


Catechol 1,2-dioxygenase [Pseudomonas stutzeri 


23/53 (43) 


3.3 


AFN77848.1 ■ 












DSM 10701] 






1 


100 


46959 


47195 


F 


8.87 


Related to Beta-mannosidase precursor 


15/41 (37) 


4.6 


CBQ72430.1 ■ 












[Sporisorium reilianum SRZ2] ^^^^^ 








105 


49552 


49986 


F 


16.8 


Hypothetical protein ROI_04910 [Roseburia 


21/65 (32) 


0.058 














intestinalis M50/1] 








110 


55222 


57189 


F 


73.4 


Putative DNA gyrase subunit B-topoisomerase 


629/651 (97) 


0.0 


■ . ■ . 












[Lactococcus <1>949] 








111 


57191 


59305 


F 


79.22 


Putative DNA gyrase subunit A-topoisomerase 


675/704 (96) 


0.0 


YP_004306255.1 












[Lactococcus <1>949] 








114 


59918 


60436 


F 


19.0 


Putative nucleoside-2-deoxyribosyltransferase 


166/172 (97) 


1e-114 


YP_004306259.1 












[Lactococcus <1>949] 








115 


60429 


61088 


F 


24.9 


Putative nicotinamide mononucleotide transporter 


205/219 (94) 


3e-142 


YP_004306260.1 












[Lactococcus <1>949] 








116 


61168 


63669 


F 


95.7 


Putative anaerobic ribonucleoside-triphosphate 


268/791 (34) 


2e-134 


YP_007237286.1 












reductase [Clostridium <I>phi8074-B1] 








117 


63669 


64265 


F 


22.7 


Putative ribonucleoside triphosphate reductase 


167/198 (84) 


2e-126 


YP_004306263.1 












activator small subunit [Lactococcus $949] 








118 


64356 


65258 


F 


35.4 


Hypothetical group 1 intron protein [Lactococcus 


166/238 (70) 


8e-111 


YP_004306264.1 












<S>9A9] 








119 


65327 


66550 


F 


45.8 


gp109 [Lactococcus *KSY1] 


151/389 (39) 


1e-69 


YP_001469108.1 


122 


67083 


67322 


F 


9.16 


Putative ribonucleotide diphosphate reductase 


71/79 (90) 


3e-44 


YP_0043062671 












n 1 1 ita roHnvi n ciiKiini+[/ ^/^fr^rTM^/^iic: ffiQAQl 
y 1 u la 1 cuuAi i i ouuu i n l[ lcju lwLiL/kjL'Uj Mt'ch-cjj 








123 


67315 


67686 


F 


14.1 


Putative ribonucleotide reductase stimulatory 


121/123 (98) 


2e-79 


YP_004306268.1 












protein [Lactococcus <1>949] 








124 


67700 


69889 


F 


82.8 


Ribonucleotide-diphosphate reductase subunit 


465/715 (65) 


0.0 


WP_0033230971 












alpha [Bacillus alcalophilus] 








125 


69902 


70312 


F 


16.0 


Hypothetical protein TTHERM_00313410 


19/46 (41) 


0.48 


XP_001033522.1 












[ Tetrahymena thermophila] 
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ORF 


Start 


Stop 


F/R 


MW(kDa) 


Best match 


%ID 


E-value 


Accession number"^ 


126 


70395 


70991 


F 


23.0 


Hypothetical group 1 intron protein [Lactococcus 


89/126 (71) 


3e-60 


YP_004306264.1 












*949] 








127 


71101 


71751 


F 


25.7 


Hypothetical protein IL. lactis ssp. cremoris A76] 


89/208 (43) 


2e-49 


YP_005876546.1 


128 


71773 


72513 


F 


29.3 


ATPase ISolibacillus silvestris StLB0461 


70/229 (31) 


8e-18 


YP_006461 965.1 


129 


72726 


72872 


F 


5.74 


Unnamed protein product [Tetraodon nigroviridis] 


16/40 (40) 


1.2 


CAG05661.1 Ji^H 


130 


72869 


73153 


F 


10.7 


M23 peptidase domain-containing protein [C. 


29/83 (35) 


0.46 


EIA163371 












perfringens F262] 








131 


73169 


73390 


F 


8.71 


Hypothetical protein LP030nr2_047 [Listeria 


19/37 (51) 


3e-04 


YP_0081 26743.1 












*LP-030-2] 








132 


73392 


73787 


F 


15.5 


Uncharacterized protein BN765_00487 


27/91 (30) 


3.1 


CDA41513.1 H 












lEubacterium eligens CAG:72] 






1 


133 


73797 


74150 


F 


13.8 


Hypothetical protein CRE_21750 ICaenorhabditis 


25/79 (32) 


1.2 


XP_0031053871 ■ 












remanei] 








134 


74153 


74335 


F 


7.15 


Hypothetical protein GY4MC1_0607 [Geobacillus 


15/48 (31) 


1.5 


YP_003988046.1 ■ 












sp. Y4.1MC11 






M 


135 


74404 


75114 


F 


27.3 


Hypothetical protein [Bacillus amyloliquefaciens 


85/238 (36) 


3e-36 


YP_005541 036.1 












TA208] 








136 


75475 


76098 


F 


22.3 


Diphosphate reductase [Streptococcus ovis] 


131/184 (71) 


5e-88 


WP_018378302.1 


137 


76177 


76746 


F 


17.0 


Hypothetical protein [Lachnospiraceae bacterium 
A41 


35/142 (25) 


0.92 


WP_016282214.1 


138 


76752 


77021 


F 


22.0 


Hypothetical protein llmg_2104 [L. lactis ssp. 


27/ 90 (30) 


0.008 


YP_001 033358.1 












cremoris MG1363] 








139 


77047 


77322 


F 


10.6 


Hypothetical protein [Clostridium sp. 7_2_43FAA] 


24/84 (29) 


72 


WP_008676880.lB 


140 


77312 


78226 


F 


10.6 


Putative ribonucleotide diphosphate reductase 


278/ 298 (93) 


0.0 


YP_004306269.1 












alpha subunit [Lactococcus <1>949]] 








141 


78305 


78985 


F 


32.9 


Putative Guanylate kinase [Lactococcus <1>949] 


198/226 (88) 


6e-144 


YP_004306270.1 


142 


78963 


79706 


F 


27.8 


Putative deoxyuridine 5'-triphosphate 


239/247 (97) 


1e-176 


YP_004306271.1 












nucleotidohydrolase [Lactococcus <1>949] 








143 


79803 


80564 


F 


28.1 


Putative protease [Lactococcus <I>949] 


227/ 253 (90) 


1e-167 


YP_004306272.1 


147 


82174 


85336 


F 


122.3 


Putative DNA polymerase III alpha subunit 


641/692 (93) 


0.0 


YP_004306276.1 












[Lactococcus <l>949|-truncated 








148 


85420 


85596 


F 


65.1 


Putative uncharacterized protein [Odoribacter sp. 


14/32 (44) 


9.4 


WP_021989169J^ 




















153 


87639 


86959 


R 


26.1 


Putative Holliday junction resolvase [Lactococcus 


208/ 226 (92) 


2e-130 


YP_004306283.1 












cpy4yj 








1 54 


88100 


87693 


D 

n 


14.7 


Putative phage holin [Lactococcus 0949] 


11 O / IOC IO~7\ 

\ lo/ loo (o/J 


1 e-77 


Yr_uu4ouozo4. I 


1 57 


yu4 1 o 


88586 


D 

n 


65.9 


Putative tail protein [Lactococcus 0949] 


/ICO / CI C /"7C\ 

4do/ d Id \/oi 


0.0 


VD nn/ioncoo"7i 
Yr_UU4oUOZo/. 1 


1 OO 


yucsy 1 


on /1 1 D 

yu4 1 o 


p 

n 


1 /.O 


Putative tail protein [Lactococcus $949] 


1 C"7 /QC\ 

loU/ 10/ lat)) 


oe-yo 


VD nri/iQncoQQ 1 
T r_UU4oUDZoo. 1 




y 1 /DO 


yuyjz 


p 

n 


o 1 .4 


Putative tail protein [Lactococcus $949] 


1 Q/i lion i^o\ 


ze-/o 


VD nri/iQncoQQ 1 
Y r_UU4oUDZoo. 1 


160 


92858 


91731 


R 


41 .8 


Host-specificity [Lactococcus <l>blL286] 


136/228 (60) 


3e-67 


NP_076691.1 


161 


93009 


92836 


R 


6.4 


Orf56 [Lactococcus 't>blL286] 


52/57 (91) 


1 e-29 


NP_076690.1 


162 


93704 


93009 


R 


27.4 


Orf55 [Lactococcus 'i>blL286] 


209/228 (92) 


4e-141 


NP_076689.1 


163 


94137 


93751 


R 


13.2 


Orf55 [Lactococcus *blL286] 


120/128 (94) 


5e-62 


NP_076689.1 


164 


98313 


94153 


R 


156.2 


Putative tail protein [Lactococcus <1>949] 


1088/1309 (83) 


0.0 


YP_004306288.1 


165 


99184 


98420 


R 


28.6 


Putative phage tail component [Lactococcus 4)949] 


236/ 254 (93) 


3e-169 


YP_004306289.1 


166 


109063 


99236 


R 


357.7 


Putative tail tape measure protein [Lactococcus 


2780 /3276 (85) 


0.0 


YP_004306290.1 












*949] 








167 


110073 


109081 


R 


38.4 


Putative site specific recombinase [Lactococcus 


326/ 330 (99) 


0.0 


YP_004306291 1 












*949] 








168 


110552 


110070 


R 


18.7 


Putative phage structural protein [Lactococcus 


153/ 160 (96) 


7e-111 


YP_004306292.1 












*949] 








172 


112890 


112189 


R 


25.1 


Putative phage structural protein [Lactococcus 


197/234 (84) 


4e-139 


YP_004306296.1 



'i>949] 

(Continued) 
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Table 3 | Continued 



ORF 


Start 


Stop 


F/R 


MW(kDa) 


Best match 


%ID 


E-value 


Accession number*^ 


174 


114260 


113655 


R 


22.9 


Putative plnage structural protein [Lactococcus 
<t>949] 


186/201 (93) 


6e-136 


YP_ 


_004306298.1 


175 


115012 


114260 


R 


28.5 


Putative plnage structural protein [Lactococcus 
#949] 


240/ 250 (96) 


2e172 


YP_ 


_004306299.1 


177 


115749 


115309 


R 


16.9 


Putative phage structural protein [Lactococcus 
<S>9A9] 


143/146 (98) 


3e-99 


YP_ 


.004306301.1 


179 


117513 


116494 


R 


38.3 


Putative phage structural protein [Lactococcus 
*949] 


318/338 (94) 


0.0 


YP_ 


_004306303.1 


180 


118088 


117552 


R 


19.3 


Putative phage structural protein [Lactococcus 
<1>949] 


125/178 (70) 


1e-82 


VP 


_UU40UD0U^. 1 


181 


119358 


118117 


R 


46.9 


Putative phage structural protein [Lactococcus 
#949] 


401/417 (96) 


0.0 


YP_ 


_004306305.1 


182 


120911 


119370 


p 


58 1 


Pi itati\/o nhano ctri ir^ti iral nmtoin [/ ar^f/^ti^rtr'r'iic: 
ruiciLivt; [Ji layt! oliul-luicii [jiulcim [LcjutiyLiCUOUi!) 

<1>949] 


490/513 /Q6^ 


0 0 


YP_ 


_004306306.1 


183 


122632 


120935 


R 


64.9 


Putative terminase ATPase subunit [Lactococcus 
<1>949] 


558/565 (99) 


0.0 


YP_ 


_0043063071 


185 


125008 


123761 


R 


47.6 


Putative DNA polymerase [Lactococcus <t>949] 


397/415 (96) 


0.0 


YP_ 


_004306309.1 


186 


126342 


125017 


R 


50.9 


Putative phosphate starvation-inducible protein 
PhoH/ATPase [Lactococcus *949] 


337/441 (76) 


0.0 


YP_ 


_004306310.1 



"Hypothetical proteins witli >60% identity to <P949 liave been omitted. 

^ORFs stiaded gray possess E-values > 7. 

'^Accession numbers corresponding to the NCBI database. 

Abbreviations: F/R, Forward/ Reverse; MW, Molecular v^^eight; %ID, % Identity; E-value, Expect-value. 



recognition sites, that offer an advantage over other bacterio- 
phages. Group I introns are frequently associated with endonu- 
cleases (Chevalier and Stoddard, 2001), however, ORFs 118 and 
126 were both found to encode this type of intervening sequence 
(TVS) between ribonucleotide reductase proteins. Previously, two 
introns were found within the ribonucleotide reductase large sub- 
unit of Staphylococcus aureus OTwort (Landthaler et al., 2002). In 
Escherichia coli <I>T4, the aerobic Ribonucleotide reductase small 
subunit and the anaerobic ribonucleotide reductase were both 
found to possess group I introns (Sjoberg et al, 1986; Young 
et al., 1994). One explanation offered for the retention of introns 
in genes, such as those involved in ribonucleotide reduction, is 
that they encode functions replicated in the host organism that 
can be utilized for phage survival until such time as the insertion 
element and host environment adapt to one another (Derbyshire 
and Belfort, 1998). 

The acquisition of ribonucleotide reductases (RNRs) is 
thought to arise from the host genome, to enable the adapta- 
tion to certain environmental conditions (Dwivedi et al, 2013). 
These enzymes function in DNA replication and repair via the 
conversion of ribonucleotides to deoxyribonucleotides (Dwivedi 
et al., 2013). The gene products of 7 ORFs were found to 
encode ribonucleotide di- and tri-phosphate reductases. ORF 116 
encodes a putative anaerobic ribonucleoside-triphosphate reduc- 
tase with a 25% identity to a putative anaerobic ribonucleoside- 
triphosphate reductase of Clostridium <t'phi8074-Bl. A class 
III ribonucleotide reductase domain cd01675 was identified 
within this gene product which uses a FeS cluster and S- 
adenosylmethionine to generate a glycyl radical. ORF 124 shares 
a 65% identity with the ribonucleotide-diphosphate reductase 



subunit alpha of Bacillus alcalophilus and contains a class I RNR 
domain cd01679. In contrast to class III, a class I RNR domain 
uses a di-iron-tyrosyl radical. The presence of class I and class 
III RNR genes in <1>L47 conforms with the RNR complement 
generally found in phages isolated from sewage (Dwivedi et al., 
2013). Other ribonucleotide reductases shared a large degree of 
homology with <I>949 as well as other genes involved in nucleotide 
transport, modification and degradation. 

tRNA ENCODING GENES IN L47 GENOME 

The role of tRNAs in phage genomes is thought to be the encoding 
of codons that are less frequent in the host genome, thus allow- 
ing for the increased expression of phage proteins (Bailly-Bechet 
et al, 2007). In all, 8 tRNA genes (Table 4) were identified over 
a small region of the genome (48,524-53,674 bp) in <t>L47, using 
the tRNA-scan SE and Aragorn software (Lowe and Eddy, 1997; 
Laslett and Canback, 2004). tRNAs had a %GC content ranging 
from 39.4 to 50.7. Between ORFs 106 and 107, were positioned 
tRNA^s, tRNA'^'^P and tRNA'^^ while between ORFs 107 and 108 
were tRNA'^™, tRNA^" and tRNA^'P. A 58 bp intron, from posi- 
tion 34 to 35, was identified in tRNA^'=''. Two tRNA'^'=* genes were 
identified, the first situated between ORFs 101 and 102, and the 
second between ORFs 107 and 108. The possession of more than 
one tRNA for a specific amino acid is a phenomenon that has 
been observed in Lactobacillus plantarum bacteriophage <t>LP65 
(Chibani-Chennoufi et al., 2004b). The acquisition of tRNA genes 
from the host is proposed as being a random event and the genes 
are either retained, via a set of selection mechanisms, or they 
are lost (Bailly-Bechet et al, 2007). It is conceivable that cI>L47 
has acquired two tRNA^*^* in order to gain a fitness advantage. 
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FIGURE 4 I Genome organization of tlie lactococcal phage *L47. Each arrow represents an open reading franne (ORF), the orientation of which shows the 
direction of transcription. The predicted function of each ORF was determined by bioinformatic analyses (Carver et al., 2005). 



due to the relatively harsh environment that it inhabits. In-situ 
burst sizes for organisms in a nutrient-poor environment, such 
as that from which the host was isolated, are generally smaller 
than when the host is infected in a nutrient rich, chemically 
defined media (Chibani-Chennoufi et al., 2004a). The retention 
of particular tRNA genes in phage genomes are thought to cor- 
respond to codons that are less abundant in the host genome 
(Bailly-Bechet et al., 2007) although this was not observed for 
<I>949 and its host L. lactis IL1403 (Samson and Moineau, 2010). 
Therefore, we may only hypothesize that these tRNA genes are 
associated with controlling phage protein production and are 
possibly involved in increasing reproduction rate and reducing 
latency time (Bailly-Bechet et al, 2007). 

The arrangement of the tRNAs into distinct blocks suggests 
that they were obtained through separate recombination events 
with either host DNA, other phages or a combination of both 
(Weinbauer, 2004). The presence of tRNAs has been found to 



Table 4 | tRNA arrangement in $L47. 



tRNA 


Amino acid 


Anti-codon 


Size (bp) 


%GC 


Start 


End 


1 


Met 


CAT 


73 


47.9 


48524 


48596 


2 


Arg 


TCT 


73 


39.7 


50343 


50415 


3 


Asp 


GTC 


75 


50.7 


51213 


51287 


4 


Ala 


CGC 


71 


50.7 


51639 


51709 


5 


Ser 


ACT 


92 


43.5 


51967 


52116 


6 


Asn 


ATT 


76 


39.5 


52830 


52905 


7 


Trp 


CCA 


71 


39.4 


53114 


53184 


8 


Met 


CAT 


74 


55.4 


53400 


53473 



be particularly common in phages with a larger genome size 
(Bailly-Bechet et al., 2007). Six tRNA genes were identified in 
(^949, the most identified in a lactococcal bacteriophage (Samson 
and Moineau, 2010). Similarly, in $Lb338-l and <I>K, both 
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with a genome larger than 100 kb, 2 and 4 tRNAs were iden- 
tified, respectively (O'Flaherty et al, 2004; Alemayehu et al., 
2009). No tRNA encoding regions were found in 2 recently elu- 
cidated lactococcal phage genomes, belonging to a new P335 
group, with sizes of ~31kb (Mahony et al., 2013). However, 
5 tRNA genes have been identified in <I>P087, a P087 species, 
of a smaller genome size (60,074 bp) than <I>949 and <t>L47 
(Villion et al, 2009). 

COMPARATIVE GENOME ANALYSIS 

Numerous ORFs in the L47 genome were found to show iden- 
tity to putative proteins of other lactococcal phages. ORFs 27-30 
encode hypothetical proteins belonging to different lactococcal 
phages (Table 3). No conserved domains were detected within 
these ORFs. ORF 119, showing 39% identity with Lactococcus 
<I>KSY1, was found to encode a pfam01139 domain, a conserved 
domain of the uncharacterized protein family UPF0027. Putative 
proteins expressed by Enterococcus <I>phiEF24C and Enterococcus 
faecium are also found to be similar to those identified in the 
<I'L47 genome (3 ORFs; >40% identity). Previously, lactococcal 
OP087 was shown to possess genes with a high degree of sim- 
ilarity to structural genes of a prophage of Enterococcus faecalis 
V538 (Villion et al., 2009). Of note, ORF 23 was found to be 
highly similar to a phage protein of the plant Lactococcus strain, 
KF147. Upstream from the structural genes, ORF 130 encodes a 
M23 peptidase domain containing protein similar to Clostridium 
perfringens F262 (Nowell et al., 2012). The position of this gene 



with genes involved in DNA replication, suggests that it is not tail 
associated as in Lactococcus phages Tuc2009 and TP901-1 (Seegers 
et al, 2004; Stockdale et al., 2013). This peptidase would possibly 
hydrolyse peptidoglycan via D-Ala-D-Asp endopeptidase activ- 
ity enabling the penetration of stationary phase lactococcal cells 
(Samson et al., 2013). 

Comparative genomic analysis shows the high similarity at a 
nucleotide level between the 0147 and <t>949 phages (Figure 5). 
In addition, given the close amino acid identity at protein level 
across the genome (Table 3), suggesting a number of shared func- 
tions, it is likely that these phages have shared a common ancestor 
at some point. Studies of the more common lactococcal phage 
groups have indicated a large degree of homology between mem- 
bers of the c2 and 936 species while the P335 species displays a 
polythetic species theory (Deveau et al., 2006). Although much 
discussion surrounds phage evolution, in relation to virulent 
phages vertical lines of evolution are believed to be crucial in 
the development of certain phage families (Brussow and Kutter, 
2005) and are particularly evident in dairy phages (Briissow and 
Desiere, 2001). However, horizontal genetic exchange also plays 
a key role in the evolution of phages as exemplified by genetic 
mosaicism formed from a high occurrence of transfer events 
(Canchaya et al., 2003). Genetic exchange between bacteria and 
other phages from distinct niches is hypothesized to be pivotal 
in the evolution of rare lactococcal phages (Fortier et al., 2006; 
Chopin et al., 2007; Garneau et al, 2008; ViUion et al, 2009). 
Due to the high degree of sequence homology between these two 




FIGURE 5 I BLASTn-based alignment (nucleotide identity >90% 
shown) of the genome sequences of L. lactis phage 4>L47 
and 4>949 as displayed by the Artemis comparison tool (ACT). 

Red lines between genomes indicate ortlnologs in tine same 



orientation. Coding sequences are color coded as described in 
Figure 4; structural (blue), hypothetical (green), DNA replication 
(yellow), ribonucleotide reductase (red), endolysin-holin (mauve), 
unknown (gray). 
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phages, it is expected that <I>L47 is another member of the 949 
group. Analysis of <I> 1 1 1 revealed this lactococcal phage to possess 
a genome size of ~ 134 kb and a long tail of 470 nm (Prevots et al., 
1990). However, without comparative genome analysis it can only 
be speculated that <I> 1 11 is a representative of the 949 group. 

CONCLUDING REMARKS 

To date, numerous studies have investigated dairy lactococ- 
cal phages and how they interact with their respective hosts. 
Industrial Lactococcus strains, used in modern cheese production, 
are thought to have evolved from plant strains (Kelly et al., 2010). 
<t>L47, isolated from a non-dairy environment, possessed signif- 
icant similarity to the rare, dairy phage 0949, with both phages 
isolated from different environments almost 40 years apart. <1>L47 
was found to possess a number of features which differentiate it 
from <I)949, most notably a long tail fiber, not previously reported 
in phages of Lactococcus. This tail fiber may play an important role 
in enabling (I>L47 to infect L. lactis DPC6860, which was largely 
resistant to dairy phages, and may account for the persistence of 
successful virulent phages in the wider environment as observed 
for $Q33 and 4)BM13 (Mahony et al, 2013). Further studies are 
required to establish the diversity of lactococcal bacteriophages 
from non-dairy origins and the similarity they may possess with 
dairy phages of other groups. This information could shed further 
light on the mechanisms they possess that allow them to thrive 
in harsher environments and may advance our understanding of 
host recognition and infection by lactococcal phages. 
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